Optimal rates for independence testing via U-statistic permutation tests
نویسندگان
چکیده
We study the problem of independence testing given independent and identically distributed pairs taking values in a ?-finite, separable measure space. Defining natural dependence D(f) as squared L2-distance between joint density f product its marginals, we first show that there is no valid test uniformly consistent against alternatives form {f:D(f)??2}. therefore restrict attention to impose additional Sobolev-type smoothness constraints, define permutation based on basis expansion U-statistic estimator prove minimax optimal terms separation rates many instances. Finally, for case Fourier [0,1]2, provide an approximation power function offers several insights. Our methodology implemented R package USP.
منابع مشابه
On the Permutation Distribution of Independence Tests
One of the most popular class of tests for independence between two random variables is the general class of rank statistics which are invariant under permutations. This class contains Spearman’s coefficient of rank correlation statistic, Fisher-Yates statistic, weighted Mann statistic and others. Under the null hypothesis of independence these test statistics have a permutation distribution th...
متن کاملBootstrap and permutation tests of independence for point processes
Motivated by a neuroscience question about synchrony detection in spike train analysis, we deal with the independence testing problem for point processes. We introduce non-parametric test statistics, which are rescaled general U -statistics, whose corresponding critical values are constructed from bootstrap and randomization/permutation approaches, making as few assumptions as possible on the u...
متن کاملNonparametric Optimal Tests for Independence in the Elliptical VAR Model
In this work we construct a class of locally asymptotically most stringent (in the Le Cam sense) tests for independence between two sets of variables in the VAR models. These tests are based on multivariate ranks of distances and multivariate signs of the observations and are shown to be asymptotically distribution-free under very mild assumptions on the noise, which is obtained by applying a l...
متن کاملPermutation Tests for Classification
We introduce and explore an approach to estimating statistical significance of classification accuracy, which is particularly useful in scientific applications of machine learning where high dimensionality of the data and the small number of training examples render most standard convergence bounds too loose to yield a meaningful guarantee of the generalization ability of the classifier. Instea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Annals of Statistics
سال: 2021
ISSN: ['0090-5364', '2168-8966']
DOI: https://doi.org/10.1214/20-aos2041